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Commonalities and differences in correlation analysis in terms of phase space, 
conditioning and uncorrelatedness are discussed. The Poisson process is not gen- 
erally appropriate as reference distribution for normalisation and cumulants, so 
that generalised statistics in terms of arbitrarily defined reference processes must 
be employed. Consideration of the sampling hierarchy leads us to a classification 
of current event-by-event observables. 



1 Introduction 

In theory, knowledge of the fully differential probability distribution at all 
points contains everything there is to know about a given reaction. While 
this statement may be true, it is meaningless in practice because experimental 
samples are invariably too small to access even a fraction of the required 
information; also, the dimension of phase space is large, creating problems of 
projection and visualisation. 

In this situation, multiparticlc correlations provide meaningful answers 
through inclusive sampling: projectiles A and B collide to yield N final-state 
particles^, of which the observer considers only q particles at a time, A+B — ► 
Pi +£>2 + • • • +Pq + X, while ignoring the rest X by means of final-state phase 
space integration. In the simplest case q=0, one considers the multiplicity and 
the size of the available phase spaced advancing to one-particle distributions 
(5=1), two-particle correlations (q=2) and so forth. Every step on the q- 
ladder presents new and increasingly subtle challenges, both physicswise and 
in statistical sophistication. This review hopes to highlight both the simplicity 
and unity that a rigorous mathematical approach has taught us. 

2 Phase space and conditioning 
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Data to be analysed typically consists of a sample S of events e = 1, 2, . . . , M e 
each with a different multiplicity N e of tracks, each of which has momentum 
and other measured quantum numbers contained in a vector Xf, i = 1, . . . N e . 



* Proceedings of the 30th International Symposium on Multiparticle Dynamics, Tihany, 
Hungary, October 2000, ed. T. Csorgo and W. Kittel, World Scientific (to be published). 
a We shall pretend that multiparticle final states consist of one particle species only. 



For q-th order correlation measurement, the sample is completely described 
by the set of per- event counter densities, one for each event |] 

N 

p q (x!,...,x q )= J2 fi. X tv X ^---^t)5{x 1 -Xt 1 )---5{x q -Xt i ), (1) 

which registers a value fi(xi, . . . , x q ) whenever the coordinates (Xf ± , . . . , Xf ) 
of an ordered data q-tuple exactly match the observation points (xi, . . . , x q ). 
Some simple examples for p q and / are the all-charge rapidity counter pi (y) = 

J2iLi$(y - Vi) witn / = !> and ' using a 4-dimensional = (y l ,(t>i,p±i,c i ) 
with charges Ci — ±1, a second-order positives-only counter p2(xf,xj) = 
J2i=ij <5 Ci ,i<5 Cj ,i <5(xi - X.f ) <5(x 2 - X|) or a charge-charge counter /3 2 (xi, x 2 ) = 
E^CiC^Xi-XfJ^xa-X!). 

Essentially all correlation measures can be represented as combinations 
of the two kinds of averages over p q (xi, . . . , x q ) shown in Figure 1: 

1. "spatial" averaging, in the form of phase space integrals 

/ fi dx\ dx2 ■ ■ ■ dx q of the arguments x over a selected region of phase 
space fi, and 

2. sample averaging ()s — (1/A/" CV ) J2 e over a sam ple S. 
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Figure 1. Phase space and event averaging of the per-event counter p q yields, respec- 
tively, the per-event factorial moment ivM = N(N—1) ■ ■ ■ (N—g+1), the correlation density 
p q (xi, . . . ,x q ) and the sample factorial moment (N™) for a particular choice of fl and S. 

Along with the choice of variable x and weight function /, the choice of phase 
space 51 and sample S over which to average is motivated by the physics 
under investigation^} Choice of fl concerns particle selection; choice of S, 
called conditioning, concerns event selection. 

^Notations * and e shall be omitted and used interchangeably as is opportune. 
Experimental limitations such as acceptance and limited statistics also play a role. 
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Figure 2. Phase space integration domains Q (shaded areas) for various stages of box in- 
tcrmittency analysis. Each point represents a particle 2-tuple (pair) of the event whose 
tracks are represented as dots on the line below the phase space plot. Also shown is the 
corresponding increasingly fine binning of the event. 



We focus first on 17. In Figure 2, we show, for the old box-moment intermit- 
tency analysis, the appropriate O's as shaded areas, for various resolutions 
L/2- 7 , superimposed on an example counter p\{x\,X2), each of whose dots 
represents a pair drawn from the N = 8 tracks shown on the line below the 
plot itself. Further examples of ft for p 2 (2/1,2/2) for fixed y\ and the differ- 
ential correlation integral □ over the same counter are shown in Figs. 3a and 
3b respectively; greyscales represent the O's for successive data points. The 
correlation density p q (x\, . . . , x q ) = (p q ), which for q = 2 is conveniently rep- 
resented as a two-dimensional contour plot, is shown schematically in Figs. 
3c and 3d for rapidity y, together with O's for forward-backward and rapidity 
gap measurements. 




Figure 3. Examples of phase space integrals over Q. From left to right: (a) box correlations 
keeping rapidity j/i fixed; (b) correlation integral with linear binning in \xj —X2\; (c) forward- 
backward correlations for fixed bin widths but varying bin-bin separation; (d) rapidity gap 
correlations. 
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There is freedom of choice in sample averaging also. "Conditioning" is 
the choosing of a subsample of events from the available full sample ac- 



selections, and kaon-to-pion ratio. 

The influence of conditioning appears to be obvious and quite limited, 
in that different (sub)samples will clearly yield different results, and so little 
attention has been paid to it. This is deceptive, however: conditioning, or 
rather its absence, directly enters the foundations of correlation analysis as a 
hidden assumption and is therefore of central importance. 

3 Uncorrelatedness 

Experimental correlation analysis needs to be both measurable and mean- 
ingful. It is measurable only as far as trivial contributions to the correlated 
signal can be successfully eliminated; it is meaningful only to the extent that 
implicit assumptions are understood and fulfilled. Elimination of trivialities 
is accomplished in part by cumulants and appropriate normalisation, while 
consideration of an assumption underlying uncorrelatedness, the absence of 
correlation, will lead us to the "reference distribution" . 

3. 1 Cumulants 

Moments N q and factorial moments N^ q \ whether per-event or sample- 
averaged, contain correlations of all orders < q and so are very bad indi- 
cators of correlation of order q itself. These trivial lower-order contributions 
are removed by the use of cumulants, which are thus generally preferable. d 
Important for our purposes are two well-known cumulant properties: 

• Behaviour under statistical independence: A differential cumu- 
lant (e.g. Ci(x\,X2) — P2(xi,x 2 ) — pi(xi)pi(x 2 )) becomes zero whenever 
any one of its q arguments is statistically independent of (uncorrelated 
with respect to) the others, and the integrated factorial cumulant (e.g. 
(N(N—1)) — (N) 2 ) is zero when the distribution in f2 is poissonian: 



d There are, to my mind, only two justifications for use of moments: the first lies in possible 
scaling behavior, N q (Q) ~ Q^i with constant f q ; the second is that moments can be defined 
on a per-event basis, which cumulants generally cannot. 




C q {x u ...,X q ) 



statistical independence 
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I z~i / \ i 7 Poissoniaii in / n \ 

/ C q (xi, . . . , x q ) ax i ■ ■ ■ ax q *■ 0. (3) 

Jn 

• Additivity: If two distributions Pf and Ph are independent, the cumu- 
lant of the convolved distribution is the arithmetic sum of the individual 
cumulants, 

C {f*h) = C (f) + C {h) (4) 

There is a problem, however: While eq. (H) is true and seems simple enough, 
it assumes implicitly that the sample S (i) is inclusive, and (ii) has an over- 
all poissonian multiplicity distribution P^ = e~ v v N / 'Nl. A sample for which 
these assumptions are not fulfilled (i.e. almost every conditioned sample!) will 
hence not yield zero cumulants, even when it is otherwise considered "uncor- 
related" by the naive user. Indeed, it has long been a source of consternation 
that for fixed- multiplicity samples, Pfj — 8$ N , the cumulant does not in- 
tegrate to zero, J dx\ dx 2 C 2 = / dx\ dx 2 {p2 - Pi) = N(N—1) — N 2 = —N; 
similarly, the nonpoissonian overall multiplicity of UA1 data is known to be 
the source of nonzero C2(f2tot)uAi — 0.4. 

For the stronger condition (^) to be fulfilled, the uncorrelated sample 
needs to be a realisation of the Poisson process, for which both P^ is poisso- 
nian and all possible subdivisions of phase space are both mutually uncorre- 
lated and poissonian in themselves. 



3.2 Normalisation 

Normalisation is designed to get rid of the trivial dependence on the overall 
multiplicity (leaving nontrivial multiplicity-driven effects) and the shape of 
the one-particle distribution. Both trivial multiplicity dependence and shape 
are eliminated by the "vertical" or differential normalisation pi{x{) ■ ■ ■ px(x q ), 
which is commonly held to be appropriate because for uncorrelated samples 
Pq — * p\ • • • Pi an d so 

/ \ Pqi^l: • • ■ ) •Eq) statistical independence /^x 
r q {X\, . . . , Xq) — t - 1 . (DJ 

pi{xi) ■ ■■pi{x q ) 

Again, however, this normalisation turns out to be incorrect for any condi- 
tioned sample; again, the implicit assumption being made in writing p q — > p\ 
is that the uncorrelated case is the Poisson process, which is clearly impos- 
sible once conditioning changes P*,. Most experimentalists are intuitively 
aware of this; for example, no central collision sample is ever normalised with 
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poissonian but strictly by one constructed with the same experimental 
non-poissonian governing the numerator.^] 

3.3 The reference distribution 

We have seen that textbook definitions of moments, cumulants and normali- 
sations mathematically equate uncorrelatedness with the Poisson process. We 
also know that the majority of samples are not compared to the Poisson pro- 
cess at all; on the contrary, all sorts of nonpoissonian cases are considered 
trivial or, in our parlance, definitive of "uncorrelatedness" . 

Moreover, different physics questions addressed to the same data sam- 
ple may consider different parts of its correlation structure to be trivial and 
nontrivial: some questions need to eliminate kinematical constraints, some 
do not; some theorists want to eliminate the effect of resonances in order 
to isolate the "quantum statistics" , while others need to eliminate quantum 
statistics to look for dynamical effects. One approach wants to get rid of jet 
effects, another needs them explicitly. Different jet-finding algorithms will 
yield differently conditioned data samples. And so on. 

Clearly, in practice there is not one exclusively valid definition of the 
trivial, uncorrelated sample, but as many different ones as there are interesting 
experimental questions to ask. Explicit consideration and definition of the 
appropriate reference distribution, that distribution considered by the physics 
at hand to be trivial, is therefore not a luxury but the very foundation of 
making any meaningful statement on correlation itself. 

Once this is accepted, it follows that all measurement quantities (cu- 
mulants, normalisation, . . . ) must also be tailored to the appropriate ref- 
erence distribution. For cumulants, the tailor's job turns out to be sim= 
pie: based on discrete Edgeworth expansions, it can be shown rigorously u 
that any process / has cumulants with respect to any reference process h of 
C g (/with respect to h) = C q (f) — C q (h). These "generalised cumulants" re- 
duce to the normal ones when h is the Poisson process, and can be viewed as 
the deconvolution of / with respect to h, just as eq. (^) represents the convo- 
lution of / with h. For experimental cumulant definitions, h would be defined 
by the desired reference distribution, and the tailormade cumulant would be 

^nontrivial effect pdata preference 

With regard to normalisation, it is conceptually important that the appropri- 
ate random variable in g-th order phase space is a g-tuple (Xf t , Xf 2 , . . . , Xf ) 

e This takes care, however, only of the overall nonpoissonian character, but not of internal 
correlation issues. 
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( Nnontrivial effect _ Fqy^li ■ ■ ■ > ^ql /~\ 

f q {J,l,...,X g ) ~ „ („ „ n reference ' V > 



rather than q individual particles Xf. The distinction lies exactly in the fact 
that a g-tuple is easily conceived of as being drawn from a correlated ref- 
erence sample, while q individual particles tend to be viewed as (poisson) 
uncorrelated. The normalisation (|^) is hence to be replaced by 

^nontrivial effect _ Pq{ x li ■ ■ ■ > X q) dllt 
Pq{xi, ...,X q ) 

which by construction tends to unity when the data and reference moments 
are the same. Examples for preference are ^ ne usua j ^^^data , . . ^^^data f or 

the Poisson case, and -^^pi(a;i|iV) data • ■ ■ pi(xq\N) data for the multinomial, 
the appropriate reference distribution for fixed-multiplicity samples. More 
generally, when Monte Carlos are used to simulate what is considered trivial 
or uninteresting for a particular data set and physics question, the appro- 
priate reference moment is preference = p q ( Xu . }Xq )MC, trivial effects _ Thig 

corresponds exactly to common practice. 

Although there is no mathematical theorem for this, the natural normal- 
isation for cumulants would seem to be the same as that for moments, so 

pdata preference 

KJxi,..., z„) nontrivial cffoct = -2 j-2 . (8) 

q y A - 1 1 q> ^reference v ' 

I q 

Obviously, the identical phase space integral J„ needs to be applied to both 
numerator and denominator whenever phase space averaging is applied. 

Two applications will illustrate [the points made: the extraction ofinter- 
W + W~ Bose- Einstein correlations □ currently much under discussion u and 
the practice of "double normalisation" . In the former case, one regards 
Bose-Einstein correlations originating separately within the W + and W~ 
jets, measured in semileptonic (si) decays, to be trivial; hence Cj = 
£,be m si(+)jet _j_ ^,be m si(-)jct^ ancl go ^ e nontrivial, inter-M^VF correlations 
in the fully hadronic qqqq sample would according to (^) be 

pintcr — WW _ q,BE pairs in qqqq sample _ pBE in sl(+)jct _ £> BE in sl( — )jct ^g-^ 

Second, doubly normalised cumulants, generically of the form 

( „data / „data „data"\ 
\H2 I Pi Pi 



{ P f C /pf C pf^ 



1, (10) 



have often been applied heuristically, precisely with the intention of ridding 
the data of trivial but correlated reference distributions. Now, from (||) we 
get a nontrivial cumulant of 

/^data nMC .data „MC „MC „MC _i_ _data _data 

°2 ~ °2 _ 92 - Pi — Pi Pi + Pi Pi 

„MC „MC ' V I 

P2 P2 
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which can be reduced to the form ( |io| ) if and when pi(x) dauta = pi(x) MC 
for all x. Conventionally used double ratios are thus in agreement with our 
derivations to the extent that this condition is satisfied. For higher-order 
cumulants q > 3, however, double ratios will always differ from the "exact" 
cumulant subtraction method. n 

We mention in passing that the "statistical fluctuations" of Ref. □ are 
nothing but its reference distribution, (which, as always, does not define it- 
self but must be defined) , while the <Xg tat is the cumulant which the authors 
subtract out, in agreement with (^). 



4 Event-by-event analysis and the sampling hierarchy 

Rather than attempting an overly condensed summary of experimental results 
and issues in general,^] we now concentrate on so-called event-by-event (or 
"EbE" ) analysis which has gained increasing prominence in recent years. Our 
limited goal is to show that, as far as the statistics are concerned, approaches 
and results called EbE in the literature fall into three distinct classes, namely 
true event-by=event observables, sample statistics, and conditioned statistics. 
See also Ref. U for another classification scheme. 

To understand the significance and interrelationship between these 
classes, we describe briefly an example of a sampling hierarchy. At the basis 
of the hierarchy is the set of single track measurements, grouped into events 
making up the inclusive data sample, from which the relevant sample statistics 
can be extracted. Many samples make up a sample of samples or metasam- 
ple, yielding in turn sampling statistics. Further grouping of metasamples 
and metametasamples can clearly be continued ad infinitum, at least concep- 
tually, hopefully asymptotically approaching what a scientist might accept as 
"statistical truth" . 

In Fig. 4, this hierarchy and its related correlation structure is shown.^] 
Starting in Box f from per-event counters, the desired attribute C e ,e = 
I, . . . , A/"cv is measured per event. Direct event averaging (Boxes 2 — > 4) over 
C is convenient but also immediately loses most information. One therefore 
additionally constructs the frequency distribution, the relative frequency of 
events for which observable C takes on a particular value, 

P{C) = Pf, = A/ ~ ev(C) „ . (12) 
C EcAA cv (C) 



^The interested reader should consult existing reviews such as Refs. Q. 

9 There are, of course, other hierarchies besides the one shown, including multivariate forms 
and those where event averaging precedes phase space averaging. 
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This opens up (Box 3) the world of sample statistics, the simplest of which is 
P(C) itself. Much more information can thereafter be gleaned from higher- 
order sample statistics J^c ^c^^) w ^h various functions h. The case h(C) 
( ' leads us back (3 — > 4) to the sample mean. 

Similarly, direct averaging of sample statistics (4 — > 6) is complemented 
by constructing the sampling distribution 7-* ((C)), the relative frequency of 
samples with a particular value for attribute (C), which then forms the basis 
for sampling statistics (Box 5). 

Conditioning enters the picture when a subsample or set of subsamples is 
selected by some criterion Iq- Each resulting conditioned sample distribution 
P(C | Ic) in Box 7, with the appropriate subsample Sj c C S, then forms 
the starting point for a hierarchy of its own, with a corresponding set of 
conditioned statistics and averages. 

Popular EbE schemes can now be seen as falling into three distinct classes: 

I True per-event observables: These look directly at one event only, 

considering, for example, statistics of that event's internal structure such 
as per-event factorial moments □ the wavelet-transformed density E3 
and per-event slope parameters. Wavelet transforms are of particular 
interest, because using the wavelet as weight function / in cq. (jl|) goes 
well beyond the simple step functions or prefactors mostly in use. 

Averages within this class are necessarily restricted to phase space ( "spa- 
tial", "horizontal") averages; this includes any sums over particles. Any 
attempted differential or "vertical" normalisation will necessarily rely on 
averages over a conditioned subsample having the same property as the 
event under consideration, but external to the event itself. Needless to 
say, such mixed cases must be handled with particular care. 

Cumulants, generally reliant on event averages, are not easily defined on 
a per-event basis. Again, relative to some sample, this can be done, □ 
but only with special care. 

II Sample statistics: Quantities in Ref. and similar work are typical 

sample statistics; for example, the observable 

4>X>. ^ E P <^)>A 2 - ( E P ( P ,)M>) (13) 

is the second cumulant of the sample distribution in (p±) e , the mean 
transverse momentum of tracks in event e. 
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Clearly, such quantitites are not truly event-by-event but sample-based; 
the misnomer is probably based on the fact that Pq is an event ratio. 
Sample statistics as such are nothing new in the sense that multiplicity 
distributions, scatter plots of events Ii3 and generally observables based on 
relative frequencies of events-have been in the experimentalist's toolbox 
for a long time. New in Ref. til and similar ones is the fact that these either 
go beyond first-order sample statistics or have invented novel attributes 
C . Neither their ancestry nor their misclassification detracts from the 
interest and relevance of these quantities. 

Ill Conditioned statistics: This class can be termed EbE in that the pro- 
cess of conditioning selects or discards events according to event prop- 
erties—with, again, C a fixed multiplicity, a trigger based on a single 
tracktJ, a g-tuple (e.g. invariant mass), the whole event (total transverse 
energy) etc. After selection, however, conditioned measurements can vary 
as much in character as unconditioned ones, ranging from true Class I 
EbE measurements to (conditioned) sample statistics. As in Class II, the 
novelty lies chiefly in exploiting higher-order sample statistics. 

Nothing forbids the use of observables higher up in the sampling hierarchy; on 
the contrary, the sampling distribution in Box 5 forms the basis for any esti- 
mate of the dispersion of sample statistics ( "Were I to repeat my experiment 
many times, how often would my currently measured results be matched?"). 
Examples are Ref. 113, where sampling statistics of factorial moments were 
calculated in order to estimate the reliability of single-sample moments, and 
standard variances of means, which are just the q=2 cumulants of the sam- 
pling distribution. 

5 Conclusions 

Very briefly: The first part of this review intended to show that most corre- 
lation measurements can be understood as various phase space integrations 
and event averages over per-event counters p q . Through selection of Q and 
Si c , one zooms in on various aspects of the overall reaction. 

After discovering the Poisson process lurking as an unwarranted assump- 
tion in the normal definitions of cumulants and normalisation, we found solace 
in the existence of generalised cumulants and normalisations which appear 
successfully to allow for and accommodate any desirable reference distribu- 
tion. 

The current flurry of event-by-event analysis turns out, on closer inspec- 
tion, to consist of three different classes, namely true event-by-event quanti- 
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ties, sample statistics, and conditioned sample statistics. For clarity in this 
and most questions of experimental statistics, the appropriate sampling hier- 
archy is an indispensable navigation instrument. 
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Figure 4. Example of a simple univariate sampling hierarchy, Version 1.1 
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